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Abstract 

The O-acetylation of polysaccharides is a common modification used by pathogenic organisms to protect against external 
forces. Pseudomoncts aeruginosa secretes the anionic, O-acetylated exopolysaccharide alginate during chronic infection in 
the lungs of cystic fibrosis patients to form the major constituent of a protective biofilm matrix. Four proteins have been 
implicated in the O-acetylation of alginate, AlglJF and AlgX. To probe the biological function of AlgJ, we determined its 
structure to 1.83 A resolution. AlgJ is a SGNH hydrolase-like protein, which while structurally similar to the N-terminal 
domain of AlgX exhibits a distinctly different electrostatic surface potential. Consistent with other SGNH hydrolases, we 
identified a conserved catalytic triad composed of D190, H192 and S288 and demonstrated that AlgJ exhibits acetylesterase 
activity in vitro. Residues in the AlgJ signature motifs were found to form an extensive network of interactions that are 
critical for O-acetylation of alginate in vivo. Using two different electrospray ionization mass spectrometry (ESI-MS) assays we 
compared the abilities of AlgJ and AlgX to bind and acetylate alginate. Binding studies using defined length 
polymannuronic acid revealed that AlgJ exhibits either weak or no detectable polymer binding while AlgX binds 
polymannuronic acid specifically in a length-dependent manner. Additionally, AlgX was capable of utilizing the surrogate 
acetyl-donor 4-nitrophenyl acetate to catalyze the O-acetylation of polymannuronic acid. Our results, combined with 
previously published in vivo data, suggest that the annotated O-acetyltransferases AlgJ and AlgX have separate and distinct 
roles in O-acetylation. Our refined model for alginate acetylation places AlgX as the terminal acetlytransferase and provides 
a rationale for the variability in the number of proteins required for polysaccharide O-acetylation. 
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Author Summary 

Bacteria utilize many defense strategies to protect them- 
selves against external forces. One mechanism used by the 
bacterium Pseudomonas aeruginosa is the production of 
the long sugar polymer alginate. The bacteria use this 
polymer to form a biofilm - a barrier to protect against 
antibiotics and the host immune response. During its 
biosynthesis alginate undergoes a chemical modification 
whereby acetate is added to the polymer. Acetylation of 
alginate is important as this modification makes the 
bacterial biofilm less susceptible to recognition and 
clearance by the host immune system. In this paper we 
present the atomic structure of AlgJ; one of four proteins 
required for O-acetylation of the polymer. AlgJ is structur- 
ally similar to AlgX, which we have shown previously is 
also required for alginate acetylation. To understand why 
both enzymes are required for O-acetylation we function- 
ally characterized the proteins and found that although 
AlgJ exhibits acetylesterase activity - catalyzing the 
removal of acetyl groups from a surrogate substrate - it 
does not bind to short mannuornic acid polymers. In 
contrast, AlgX bound alginate in a length-dependent 
manner and was capable of transfering acetate from a 
surrogate substrate onto alginate. This has allowed us to 
not only understand how acetate is added to alginate, but 
increases our understanding of how acetate is added to 
other bacterial sugar polymers. 

Introduction 

Pseudomonas aeruginosa is an opportunistic, Gram-negative 
pathogen that can cause acute and chronic infections. The 
bacterium is the dominant bacterial species in the lungs of cystic 
fibrosis (CF) patients and if left untreated, is the leading cause of 
morbidity and mortality among these individuals [1]. P. 
aeruginosa is able to persist through the formation of a biofilm 
where communities of surface attached bacteria are encapsulated 
in a matrix composed primarily of secreted extracellular polysac- 
charides. Bacteria embedded within a biofilm are more resistant 
than their planktonic counterparts to environmental stresses such 
as antibiotics and disinfectants, and are able to evade the defense 
mechanism(s) of the host [2-7]. P. aeruginosa has the genetic 
capability to produce at least three different biofilm exopolysac- 
charides: Pel, Psl, and alginate [8,9] . P. aeruginosa clinical isolates 
obtained from CF patients with chronic pulmonary infections 
secrete large amounts of alginate [10,11]. This exopolysaccharide 
is synthesized in the cytoplasm and is translocated across the inner 
membrane as a linear homopolymer of D-mannuronic acid 
[12,13]. The polymer is subsequently modified in the periplasm 
through O-acetylation and epimerization to form a (314 linked 
non-repeating chain of D-mannuronic acid and its C5 epimer L- 
guluronic acid [14,15]. 

Modification of polysaccharides through the addition or 
removal of acetate is an important biological process for survival 
and virulence in many bacterial species. For example, biofilm 
formation by the human pathogens Escherichia coli, Staphylococ- 
cus aureus and Staphylococcus epidermidis, requires the partial de- 
iV-acetylation of the exopolysaccharide poly-R-l,6-A^-acetyl-D- 
glucosamine (PNAG) [16-19]. Similarly, deacetylation of Pel from 
P. aeruginosa is required for Pel-dependent biofilm formation [20] 
while deacetylation of the holdfast polysaccharide synthesized by 
Caulohacter crescentus is required for adhesion and cohesion [21]. 
In each case the deacetylation or removal of acetate from the 
acetylated polysaccharide requires a single enzyme, which has 



been shown to be a member of carbohydrate esterase family 4 
(CE4). In comparison, O-acetylation of polysaccharides is a 
complex process requiring two enzyme functionalities. The first 
functionality is the transfer of an acetyl-donor into the periplasm, 
which is hypothesized to be catalyzed by a membrane bound O- 
acetyltransferase (MBOAT) [22]. The second functionality, 
catalyzed by a periplasmic O-acetyltransferase, transfers acetate 
onto the polysaccharide. There are currently three distinct systems 
that are differentiated by the number of proteins required for O- 
acetylation; a single protein system whereby both functionalities 
are encoded on one polypeptide, a two-protein system with one 
MBOAT and one periplasmic O-acetyltransferase and, in the case 
of alginate and cellulose, a four protein system comprised of an 
MBOAT protein, two periplasmic O-acetyltransferase and a 
protein of unknown function [22,23]. 

The O-acetylation of the C6 hydroxyl of muramoyl residues in 
peptidoglycan is critical in many human bacterial pathogens 
including; Methicillin-resistant S. aureus (MSRA), Bacillus 
anthracis, Neisseria meningitides and Neisseria gonorrhoeae 
[24,25] as it confers resistance to degradation by endogenous 
autolysins and the host immune system during infection [26]. A 
single integral membrane O-acetylpeptidoglycan transferase (Oat) 
is utilized in Gram-positive bacteria, while Gram-negative bacteria 
utilize a two-protein system composed of PatA, the MBOAT, and 
PatB, the periplasmic O-acetyltransferase [27-29]. The O- 
acetylation of alginate in P. aeruginosa, is an important 
modification as acetylated alginate is less susceptible to recognition 
and clearance by the host immune system than its non-O- 
acetylated counterpart [30] . Similar to the acetylation of cellulose 
by Pseudomonas fluorescens through the combined action of 
WssG, WssH, WssI and WssF, the O-acetylation of alginate 
requires four proteins; AlgF, AJgl, AlgJ and AlgX [12,13,31-33]. 
Acetylation of alginate can occur at the C2 and C3 hydroxyl 
groups of mannuronic acid residues. Algl is predicted to be a 
member of the MBOAT family, while AlgJ and AlgX are required 
in the O-acetylation of alginate as the polymer passages through 
the periplasm [13,31,33-35]. The function of AlgF, which is not 
predicted to have a catalytic domain, is currently unknown. It is 
also unclear why alginate acetylation requires two active O- 
acetyltransferases. 

To probe the role of AlgX in alginate acetylation, we recendy 
determined its structure and found that it is a two-domain protein 
with an N-terminal SGNH hydrolase-like domain and a C- 
terminal carbohydrate-binding module (CBM) [34]. Our in vivo 
functional characterization of AlgX demonstrated that three 
catalytic residues, D174, HI 76 and S269, located in the active 
site are required for alginate O-acetylation. In the present study, 
we sought to delineate the role of AlgJ, and examine why both 
AlgJ and AlgX are required for alginate acetylation [13]. To this 
end, we have determined the structure of Pseudomonas putida 
AlgJ 75 _ 370 (PpAlgJ 7 5_ 370 ) to 1.83 A resolution and have function- 
ally characterized the protein. 

Results 

AlgJ contains an SGNH hydrolase-like core 

The TMHMM server v2.0 indicates that AlgJ from P. 
aeruginosa PAOl possesses a transmembrane helix from residues 
12-29 that tethers the periplasmic domain to the cytoplasmic 
membrane [36]; a prediction that is supported by in vivo 
localization studies [13]. To probe the function of AlgJ in alginate 
biosynthesis, we attempted to crystallize a soluble domain, residues 
79-379, of P. aeruginosa AlgJ (PaAlgJ 79 _ 379 ). Although this 
protein was recalcitrant to crystallization, we were able to 
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crystallize the orthologous domain from P. putida, P^AlgJ 7 5_ 370 , 
which shares 83% and 53% sequence similarity and identity to 
PflAlgJ 79 _ 379 , respectively. Selenomethionine-incorporated (Se- 
Met) protein was expressed, purified, crystallized, and the 
structure determined to 1.83 A using the single-wavelength 
anomalous dispersion (SAD) technique (Table 1). P^AlgJ 75 _3 70 
crystallized in space group C2 with two molecules in the 
asymmetric unit. After iterative rounds of model building and 
refinement, the structure yielded models with an i? wor k and R{ TCe of 
17.5% and 21.2%, respectively (Table 1). Analytical size exclusion 
chromatography suggests that AlgJ is a monomer in solution (data 
not shown). The dimer observed in the crystal structure is 
therefore not believed to be biologically relevant. The two 



Table 1. Summary of data collection and refinement 
statistics. 




Data Collection Statistics 


Beamline 


NSLS X29 


Wavelength (A) 


0.979 


Temperature (K) 


100 


Space Group 


C2 


Cell dimensions 


a, b, c (A) 


192.5, 34.0, 82.9 


a, 11, y O 


90.0, 91.9, 90.0 


Total no. of reflections 


358026 


No. of unique reflections 


48465 


Resolution (A) 


50.00 - 1.81 (1.90 - 1.81)" 


Average l/at 


16.0 (2.9) 


Completeness (%) 


98.8 (96.2) 


Redundancy 


7.5 (5.8) 


a b 

"merge 


10.1 (56.8) 


Refinement Statistics 


Rwork/Rfree (%)' 


17.6/21.5 


No. atoms 


Protein 


4482 


Water 


282 


Average B-factor (A 2 ) 


Protein 


39.4 


Water 


43.8 


Rms deviations 


Bond lengths (A) 


1.41 


Bond angles ( ) 


0.014 


Ramachandran plot d 


Total favoured (%} 


98.2 


Total allowed (%) 


1.8 


Coordinate error (A) e 


0.16 


PDB Code 


408V 



a Values in parentheses correspond to highest resolution shell. 
b fimerge = £ £ |l(k)-<l>|/E l(k) where l(k) and <l> represent the intensity 
values of the individual measurements and the corresponding mean values. The 
summation is over all unique measurements. 

c ^work-£ I Fobs| — k|Fcalc||/|Fobs| where Fobs and Fcalc are the observed and 
calculated structure factors, respectively. R free is the sum extended over a subset 
of reflections (4.2%) excluded from all stages of the refinement. 
d As calculated using MOLPROBITY [82]. 

e Maximum-Likelihood based coordinate error as determined by PHENIX [64]. 
doi:10.1371/journal.ppat.1004334.t001 



molecules in the asymmetric unit superimpose well with a root 
mean square deviations (RMSD) of 0.19 A over the 906 aligned 
backbone atoms. Due to the poor quality of the electron density 
we were unable to build residues N75 to G77 and A266 to K278 
in both molecules, and residues R78 and L242 in molecule A, and 
P244 and L245 in molecule B. Given the absence of two residues 
in the loop between L242 and F246 in molecule B, all structural 
analyses were performed using molecule A. All structural features 
defined using molecule A were also present in molecule B with no 
significant deviations. 

The SGNH hydrolase superfamily consists of enzymes with 
varying hydrolytic activities (e.g., proteases and lipases) [37]. 
SGNH hydrolases have an ot/p/ot fold with each of the four 
conserved active site residues responsible for catalysis residing in 
one of four conserved blocks. The structure of P^AlgJ 75 _ 370 reveals 
that the protein has an a/ p/ a fold with a core of four parallel P- 
strands (Figure 1: P3, P6-8) and an isolated P-bridge at P3, which 
are comparable to the five parallel P-strands found in canonical 
SGNH hydrolases [37,38]. The core P-strands are surrounded by 
nine ot-helices (Figure 1: ocl, 0(3— 10) that complete the ot/p/ot fold. 
A surface representation of P/>AlgJ 7 5_ 370 reveals a shallow groove 
that crosses the face of the protein (Figure 2A, left, centre). 
Examination of the electrostatic surface potential shows a distinct 
region of electronegativity on the face of the protein located within 
and below the shallow groove (Figure 2B, centre). 

Previous in vivo complementation experiments have suggested 
that AlgJ functions as an O-acetyltransferase and that this activity 
is dependent on conserved residues D193 and HI 95 (P. 
aeruginosa numbering) [33] . In P^AlgJ the corresponding residues 
D 1 90 and H 1 92 are located in the shallow electronegative groove 
(Figure 2B, centre). Residues within the groove are well conserved 
in AlgJ homologs from six Pseudomonas sp. (P. putida, P. 
aeruginosa, P. syringae, P. prolegens, P. entomophila, and P. 
alkylphenolia) and Azobacter vinelandii (Figure 2A, centre). A 90° 
rotation along the horizontal axis relative to the groove depicts a 
generally electroneutral surface (Figure 2B, right) that also 
contains highly conserved residues (Figure 2 A, right). 

A search for structurally similar proteins using DALI reveals 
that the core of AlgJ is most similar to the N- terminal domain of P. 
aeruginosa AlgX, residues 44-344 (PflAlgX 4 4_ 3 4 4 ), with an RMSD 
of 2.06 A over 165 aligned Cot-atoms (Figure 3A). The active site 
of PpAlgJ 75 _ 370 and the orientation of the putative catalytic triad 
D190, HI 92 and S288, is analogous to PaAlgX 27 _ 47 4 and other 
serine esterases and proteases (Figure 3B) [39,40]. Similar to 
PfflAlgX 27 _4 7 4, the structure of PpAlgJ 75 _ 37 o reveals several 
differences relative to canonical SGNH hydrolases [32]. Firstly, 
block III that contains the conserved asparagine residue that forms 
part of the oxyanion hole is absent (Figure 3C). Examination of 
the P/>AlgJ 7 5_ 370 structure suggests that Y348 is in a position to act 
as a hydrogen bond donor in place of the conserved asparagine 
(Figure 3B). In PflAlgX 27 _ 47 4 a tyrosine residue, Y328, also 
occupies this position. P^AlgJ 75 _ 370 also deviates from the 
canonical GDSL(S) and DxxH motifs in blocks I and V, 
respectively (Figure 3C). AlgJ has a GTSYS motif in block I 
which contains the catalytic serine (S288), while a single spacer 
residue in block V separates the two remaining catalytic triad 
residues (D190 and HI 92) to form a DxH motif. AlgJ appears to 
be a circularly permuted member of SGNH hydrolases as the 
order of conserved residues in primary sequence (H-S-G-Y) is 
different than that of SGNH hydrolases (S-G-N-H). However, the 
overall three dimensional shape and fold are structurally similar 
despite the rearrangement of conserved residues. PpAlgJ 75 _ 370 also 
contains secondary structure features not present across SGNH 
superfamily members. Two long anti-parallel P-strands are present 
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Figure 1 . Structure and topology of PpMgJ 7S ^ 70 . (A) Cartoon representation of PpAlgJ 75 _ 37 o with secondary structural elements labelled (a: a- 
helix; p: p-strand; and t: 3 10 helix). Residues at discontinuous points in the structure due to poor observed electron density are labelled and coloured 
red. The N- and C-termini of the protein are labelled N and C, respectively, and the terminal residue is coloured red. (B) Topology model of the 
PpAlgJ 75 _ 370 structure with secondary structural elements and termini labelled as in panel (A). 
doi:1 0.1 371/journal.ppat.1 004334.g001 



on one side of the protein ((34 and (35, Figure 1), along with eight 
3io helices (tl-8) and a small cap domain above the proposed 
active site which consists of two short anti-parallel (3-strands ((31- 
2), five 3 10 helices (tl-3, t5-6) and one a-helfx (oc2) (Figure 1). 
Despite these differences, -P^AlgJ 7 5_ 370 is structurally comparable 
to other SGNH hydrolases including E. coli thioesterase I (TAP), 
and Aspergillus aculeatus Rha [38] as the SGNH hydrolase 



domains align with RMSDs of 3.2 A and 3.8 A over 101 and 104 
equivalent Cot residues, respectively. 

The AlgJ signature motifs 

Two conserved sequence motifs, termed the AlgJ signature 
motifs, have been defined in P. aeruginosa AlgJ and its homologs 
[33]. These motifs are characterized by conserved regions of 
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Variable Average Conserved 




90° 







Figure 2. Surface representations of PpMgJ 7S ^ 370 and its comparison to AlgX 44 _344. (A) Surface representation of PpAlgJ 75 370 with residue 
conservation of AlgJ homologs from 6 different Pseudomonas spp. and A. vinelandii. Conservation is displayed from magenta (highly conserved) to 
cyan (variable). Surface residue conservation was analyzed using ConSurf [71]. The dashed circle indicates the region where the shallow groove was 
identified. (B) and (C) Electrostatic surface representation of PpAlgJ 75 _ 370 and PaAlgX44_ 3 44, respectively. Electrostatic surface potentials were 
calculated using the APBS Tools 2.1 plugin within PyMOL [70]. The electrostatic surfaces are displayed in both panels from -7 kT/e (red) to +7 kT/e 
(blue). 

doi:1 0.1 371 /journal.ppat.1 004334.g002 



OOOPxK (<1> represents any hydrophobic residue), and (R/ 
K)TDTHW. Mutation of residues within these signature motifs 
leads to impairment or ablation of alginate O-acetylation [33]. 
Utilizing the structure of P/;AlgJ 7 5_ 370 , we identified the location of 
the signature motif residues (Figure 4A). Two distinct types of 
intramolecular interaction networks are observed within the 
motifs, which localize to the cap domain. The cap domain sits 
atop the SGNH hydrolase-like core that typically contains the 
active site and catalytic residues of canonical SGNH hydrolases 
[37,38]. The first network composed of residues; K134, T189, 
D190 and H192 form a hydrogen-bonding network with L187, 
D254 and S288 (Figure 4B) in PpAlgJ 75 _ 370 . It was previously 
observed that alginate O-acetylation was abrogated in vivo for the 
Pa AlgJ variants K137A, D193A and HI 95 A [33]. Superposition 
of P^AlgJ 75 _ 370 with AlgX 44 _ 3 44 suggests that D190 and HI 92 
(D193 and HI 95 in PflAlgJ 79 _ 379 ) form part of the catalytic triad 
(Figure 3). Thus, two of the three catalytic triad residues (aspartic 
acid and histidine) in AlgJ reside in the separate cap domain 
distinct from the ot/pVoc fold, yet occupy an equivalent spatial 
position in the active site to their SGNH counterparts. The 
location of D 193 and HI 95 in PaAlgJ (D190 and HI 92 in P/jAlgJ) 



and the ablation of acetylation in vivo for the D193A and H195A 
variants provide further evidence of their crucial role in the 
catalytic mechanism of the protein. The second intramolecular 
interaction network is composed of a series of hydrophobic 
interactions centered around the conserved W193, with V131, 
Y289, W295 and F297 (Figure 4C). W193 is completely buried 
and comprises part of the hydrophobic core of AlgJ. 

The Ser-His-Asp triad is responsible for acetylesterase 
activity In vitro 

To determine whether AlgJ is catalytically active we examined 
the ability of the enzyme to exhibit O-acetylesterase activity, a 
commonality among SGNH hydrolases and the first half of the 
acetyltransferase reaction. PflAlgJ 79 _ 379 and P/jAlgJ 75 _ 37 o were 
both assayed to demonstrate that the enzymes are functionally 
equivalent. Using the substrate 3-carboxyumbelliferyl acetate, the 
kinetic parameters for PrtAlgJ 79 _ 379 and PpAlgJ 75 _ 370 were 
observed to be comparable (Table 2), with a 2-fold difference in 
K m . The k cat /K m obtained for AlgX, which has been previously 
demonstrated to catalyze this reaction, differed by only 3-fold 
compared to the AlgJ orthologs. To assess the function of putative 
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Figure 3. Architecture of the proposed PpAlgJ 75 _ 370 active site compared to AlgX. (A) Superposition of the cartoon representations of 
PpAlgJ 75 _37o (blue) and PaAlgX 44 _344 (grey). The conserved residues implicated in catalysis in SGNH hydrolases are displayed as sticks and coloured 
either red (PpAlgJ) or grey (AlgX). The dashed box indicates the region where the conserved residues are located. (B) Enlarged figure highlighting the 
similarity of the residues in the active site region in both proteins. Red stick residues and text identifies the conserved residues from PpAlgJ whereas 
the grey sticks and black text identifies residues from AlgX. Yellow dashed lines represent hydrogen bonding between members of the catalytic triad 
with values in angstroms (A). (C) Sequence alignment of various members of the SGNH hydrolase superfamily depicting four consensus blocks shared 
by members. PpAlgJ: P. aeruginosa O-acetyltransferase (Q88ND3), PaAlgX: P. aeruginosa O-acetyltransferase (Q51372) EstA: Lactobacillus helveticus 
arylesterase (Q9LAH7), EtpA: Vibrio mimicus arylesterase (Q07792). Conserved residues are masked in blue (absolutely conserved) or green (conserved 
in most members). Residues masked in purple correspond to the conserved tyrosine in AlgJ and AlgX that are proposed to be analogous in function 
to asparagine in other superfamily members. Brackets correspond to Uniprot identifiers. 
doi:1 0.1 371 /journal.ppat.1 004334.g003 



catalytic residues; D190, HI 92 and S288 in PpAlgJ and D193, 
HI 95 and S297 in PaAlgJ were substituted with alanine. 
Interestingly, while catalytic alanine variants in AlgX result in 
the abrogation of 3-carboxyumbellifyl acetate hydrolysis [32], 
mutation of the catalytic triad in both AlgJ orthologs only reduced 
the catalytic activity by ~80% (Figure 5). Circular dichroism 
spectroscopy of the AlgJ orthologs and their respective variants 
exhibited no significant difference in spectra (data not shown). 
This indicates that the protein variants are properly folded and 
that the differences in catalytic activity are not due to large 
structural perturbations. 



AlgX but not AlgJ interacts with mannuronic acid oligomers 

Comparison of P^Algj75_37o and AlgX 44 _344 indicates a signi- 
ficant difference in electrostatic surface potential between the 
active site regions of the enzymes (Figure 2B and C). PpAlgJ 75 _ 3 7 0 
contains a shallow electronegative groove architecture around the 
active site whereas AlgX 44 _ 344 contains a deep electropositive groove 
compatible with binding the anionic alginate orpolymannuronic 
acid polymer. In addition, AlgX has a C-terminal carbohydrate- 
binding module, which presumably aids in binding and guiding 
alginate either to or from the active site. To examine whether 
PflAlgj7g_ 3 79 and AlgX 2 7^74 interact with mannuronic acid 
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Figure 4. The AlgJ signature motifs. (A) Cartoon representation of PpAlgJ 75 _ 370 with residues from the signature motif represented as sticks. The 
AlgJ signature motif residues have been coloured according to the amount of alginate acetylation observed in vivo in each alanine variant relative to 
WT AlgJ. Yellow represents impairment of between 50-60% acetylation; orange represents strong impairment with only 6-25% acetylation observed; 
and red represents ablation or between 0-5% observed acetylation. Residues represented as grey sticks are proposed to be involved in interactions 
with the AlgJ signature motif residues but have not been characterized in vivo. (B) Selected residues from the box area in panel A that participate in 
or are proposed to be involved in the hydrogen bonding network and are coloured as described in (A). Hydrogen-bonds are depicted as green 
dashes with distance given in angstroms (A). (C) Residues participating in hydrophobic and van der Waals interactions with W193 (shown in orange) 
are depicted in grey. Hydrophobic and van der Waals interactions less than 3.7 A are depicted with blue dashes. 
doi:1 0.1 371 /journal.ppat.1 004334.g004 



oligomers, a direct electrospray ionization mass spectrometry (ESI- 
MS) binding assay was carried out using nine mannuronic acid 
oligosaccharides ranging from 4— 1 2 sugar units in length (ManAj - 
ManAi 2 ). Representative ESI mass spectra acquired for aqueous 
ammonium acetate solutions of AlgX and the protein reference scFv 
(P rc f) with ManA 6 or ManAi 2 are shown in Figures 6A and 6B, 
respectively. In the mass spectrum shown in Figure 6A, ion signals 
corresponding to protonated AlgX monomer and protonated 1:1 
(AlgX+ManAfi) complex, the +13 to +16 charge states, are 
observed. Signals corresponding to protonated P„.f and (P re f+ 
ManAg or ManA 12 ) are also present, indicating that nonspecific 
carbohydrate-protein binding took place during the ESI process. 
The mass spectrum shown in Figure 6B is qualitatively similar to 
that shown in Figure 6A, although a visibly larger fraction of the 
protein is in the bound form. Similar results were obtained for the 
other alginate ligands tested (data not shown). ESI-MS measure- 
ments were also performed on solutions of AlgX and undeca- and 
pentadeca-hyaluronic acid (HAn and HA 15 ). Importantly, these 
negative controls revealed no evidence of specific binding between 
AlgX and these acidic oligosaccharides, thereby confirming the 
specificity of AlgX for the alginate oligomers rather than any acidic 



oligosaccharide. Listed in Table 3 are the association constants (K.^) 
determined by ESI-MS, following correction for nonspecific 
carbohydrate-protein binding. Notably, the K. A values for AlgX 
are seen to clearly increase with the length of the mannuronic acid 
oligosaccharides. It must be noted that the location of ligand 
binding site cannot be explicitiy identified using this methodology 
and the observed interaction for AlgX may include both ligand- 
active site and ligand-CBM interactions. 

The same assay was utilized to quantify binding of PfflAlgJ 79 _ 
3 79 to the mannuronic acid oligomers. Representative ESI mass 
spectra are shown in Figures 6C and D for ManA 6 and 
ManA 12 , respectively. Analysis of the ESI mass spectra reveals 
two distinct charge state distributions (+10 to +13 and +14 to + 
26) for the protonated ions of AlgJ. Ions corresponding to 1:1 
complexes of AlgJ with alginate ligand were only observed at the 
lower charge states. Taken together, these results suggest that a 
fraction of AlgJ is unfolded in solution (corresponds to the +14 
to +26 charge state distribution) and does not bind to the 
oligosaccharides [41-45]. Only the lower charge state AlgJ ions 
were considered for the K a determinations (Table 3). Because 
the relative protein ion abundances measured by ESI-MS do not 



Table 2. Summary of kinetic parameters 3 . 







Kn ImMI 


A„ t <x10 3 s" 1 ) 


(M-'-S- 1 ) 


PaAlgJ 79 _ 379 


1.16±0.120 


3.44±0.129 


2.96±0.324 


PpAlgJ 75 _ 370 


2.30±0.0963 


7.80±0.155 


3.39±0.0688 


PaAlgX 27 _ 474 


1.10±0.0960 


11. 4 ±0.343 


10.3±0.961 



a Standard reactions contained variable concentrations of 3-carboxyumbelliferyl acetate (ACC), dissolved in DMSO ranging between 0.1 K m and >2 K m and 30 (ig each 
protein in a total volume of 100 U.L in 50 mM sodium HEPES buffer (pH 7.0 for AlgJ, pH 8.0 for AlgX) at 298 K. 
doi:1 0.1 371 /journal.ppat.1 004334.t002 
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Figure 5. Specific activities of wild-type and catalytic triad mutants of PpMgl 7 s-37o and PnMql y9 - 379 . Specific activities were measured at 
an initial concentration of 3.0 mM ACC. WT and specific mutants for each AlgJ construct are labelled below the x-axis. Specific activity upon the y-axis 
was measured as umol min -1 mg^ 1 . 
doi:1 0.1 371 /journal.ppat.1 004334.g005 



necessarily reflect relative concentrations in solution, the 
concentration of folded AlgJ used for the K. A calculations was 
not accurately known. Nevertheless, the ESI-MS results suggest 
that under the conditions tested binding of AlgJ to the 
mannuronic acid oligomers is extremely weak, with K a values 
of less than 500 M" 1 . 

AlgX is able to O-acetylate alginate in vitro 

Since AlgX interacts with mannuronic acid oligomers under the 
conditions tested, we examined the ability of AlgX to transfer an 
acetyl group from the pseudo-acetyl donors, 4-nitrophenyl acetate 
and 3-carboxyumbelliferyl acetate, to a decamer of polymannuro- 
nic acid (ManAio). Reactions analyzed by ESI-MS revealed the 
production of both mono- and di-acetylated mannuronic acid 
oligomers in the AlgX containing reactions as observed by an m/ z 
shift of 42.01 for the sodiated singly acetylated and 42.01 for the 
protonated doubly acetylated. The mass increase of the protonated 
singly acetylated species was 42.00 (Figure 7). Control reactions 
containing all of the reaction components except AlgX, did not 
result in acetylated alginate even though 4-nitrophenyl acetate 
exhibited low spontaneous hydrolysis at pH 7.0 resulting in the 



production of 4-nitrophenyl and acetate. Neutral, commercially 
available sugars, cellohexose, xylohexose and maltotriose were not 
O-acetylated in the presence of AlgX. While both AlgJ and AlgX 
are necessary for alginate O-acetylation in vivo [13,33] our 
binding and acetyltransferase data suggest AlgJ and AlgX do not 
have overlapping functions. Our current in vitro data provide 
additional evidence to support previous in vivo studies that 
there is no redundancy in the alginate acetylation machinery 
[31-33]. 

Discussion 

The O-acetylation of alginate requires the concerted action of 
four proteins: the putative MBOAT protein, Algl; a protein of 
unknown function, AlgF; and two annotated O-acetyltransferases 
AlgJ and AlgX. To shed light on the role of each O-acetyltrans- 
ferase, we determined the structure of P^AlgJ 75 _ 37() and compared 
it with the recently solved structure of PaAlgX 27 _474. 
Additionally, we kinetically characterized the acetylesterase activity 
of AlgJ and tested the ability of AlgJ and AlgX to bind and O- 
acetylate short polymannuronic acid oligomers. 
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Figure 6. ESI mass spectra to examine polymannuronic acid binding. ESI mass spectra acquired for aqueous ammonium acetate (50 mM) 
solutions of (A) PaAlgX 27 ^, 74 (P) (4 |xM) and 50 mM ManA 6 ); (B) PaAlgX 2M74 (P) (4 \M) and 50 mM ManA 12 ); (C) PaAlgJ 79 _ 379 (P) (10 ^M) and 50 mM 
ManA 6 ); (D) PaAlgJ 79 _ 379 (P) (1 0 jiM) and 50 mM ManA 12 ). A reference protein scFv (4 |iM) was added into the solutions of AlgX, Lyz was added as P ref 
to AlgJ solutions. Ion peaks corresponding to higher charge states of AlgJ are labelled as filled red circles. P ref refers to the reference protein scFv. 
doi:10.1371/journal.ppat.1004334.g006 



The structure oi PpAlg] 7 -^ 7a reveals a fold that is structurally 
comparable to AlgX4 4 _ 34 4 and other SGNH hydrolases. P^AlgJ, 
like AlgX, is best described as an SGNH hydrolase-like protein as 
they both exhibit several key differences to canonical SGNH 
hydrolases. Not only is the order of the catalytic residues circularly 
permuted, but both proteins contain a cap domain and two long 
anti-parallel P-strands on one side of the protein that are not 
observed in other SGNH members. Although the function of the 
two long anti-parallel P-strands is currently unknown, given the 
involvement of other proteins in the O-acetylation system it is 
tempting to speculate that these regions may be involved in 
protein-protein interactions. A prediction for potential interaction 
surfaces on AlgJ was made using the consensus Protein-Protein 
Interaction Site Predictor (cons-PPISP) [46,47]. Two predicted 



clusters of residues had positive scores for a possible interface. One 
cluster is comprised of residues 353-363 and is located on a loop 
and the beginning of alO on the C-terminal end of the protein. A 
second cluster comprised of residues P79, G80, V81, D235 and 
F239 in PpMg\ is localized to the cap domain, located above the 
SGNH core. This cluster contains the conserved AlgJ signature 
motif residues that are imperative for alginate O-acetylation. The 
structure of PpMgj has allowed us to confidendy define the 
location of these residues, and propose roles for their function in 
O-acetylation. Signature residue variants in PaAlgJ; P135A, 
K137A, D193A, H195A and W196F {PpMgj equivalents; P132, 
K134, D190, H192 and W193) ablate O-acetylation [33]. The 
structure of Pp AlgJ, in addition to our kinetic data, indicates that 
D 1 90 and H 1 92 are part of the catalytic triad. Therefore, alanine 



Table 3. Apparent association constants {K a ) for PaAlgX 27 -474 and PaAlgJ 7 g_ 37 9 for short polymannuronic oligosaccharides at 298 K 
and pH 7 determined by the direct ESI-MS assay. 





Length of oligosaccharide (Ligand name)* 


Apparent K a (M -1 ) AlgX 


Apparent K a (M -1 ) AlgJ 


4 (ManA 4 ) 


(1.0±0.5)x10 3 


<500 


5 (ManA 5 ) 


(1.0±0.5)x10 3 


<500 


6 (ManA 6 ) 


(3.2±0.3)x10 3 


<500 


7 (ManA 7 ) 


(4.8±1.0)x10 3 


<500 


8 (ManA 8 ) 


(5.8±1.0)x10 3 


NB 


9 (ManA 9 ) 


(8.8±1.0)x10 3 


NB 


10 (ManA 10 ) 


(10.0±0.7)x10 3 


NB 


11(ManA„) 


(16.0±0.5)x10 3 


NB 


12 (ManA 12 ) 


(19.0±0.3)x10 3 


NB 



NB: No Binding. 

*Ligand name as referenced in [78]. 
doi:10.1371/joumal.ppat.1004334.t003 
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Figure 7. AlgX 2 7-474 is an O-acetyltransferase. Mass spectra of the acetyltransferase reactions in the presence (red, line a) and absence (blue, 
line b) of PaAlgX 2 7_47 4 . Major peaks are labelled: ManA10, mannuronic acid decamer; ManA10+Na, mannuronic acid decamer plus sodium; ManA10+ 
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adds 42.01 Da to the mass of the polymer. The data has been deconvoluted and centroided using Agilent MassHunter software and represents 
uncharged exact masses. 
doi:1 0.1 371 /journal.ppat.1 004334.g007 



variants D190A and H192A would disrupt the proposed catalytic 
serine by increasing the pK a of the nucleophile and altering its 
orientation in the active site. The relatively rare internal lysine, 
K134 uses hydrogen bonding to stabilize the backbone oxygens of 
L187, R188 and D190 that form a loop between helices oc 3 and 0C4. 
It is expected that the loss of hydrogen bonding in the K134A 
variant disrupts the proper positioning of D 190 in the DxH motif 
required for catalysis. The P132A variant located proximal to 
K134 would alter the secondary structure of, and impede proper 
function of K134. Lastly, the W193A variant is anticipated to 
disrupt the hydrophobic interactions with its neighbouring 
residues. Given that W193 is located proximal to catalytic 
HI 92, structural perturbations caused by disruption to the 
hydrophobic interactions are expected to have a negative impact 
on the catalytic triad. 

Our in vitro enzymatic analysis probed the ability of PaAlgJ 
and P^AlgJ to perform the initial acetylesterase step of the overall 
acetyltransferase reaction. The results indicate that both AlgJ 
enzymes exhibit comparable catalytic parameters to AlgX for the 
hydrolysis of acetate (acetylesterase) from 3-carboxyumbelliferyl 
acetate [32]. Catalytic variants reduced acetylesterase activity by 
>80%. Although, the complete loss of activity was observed when 
similar catalytic residues were replaced in AlgX, in vitro residual 
activity in catalytic variants has been reported in at least one 
SGNH esterase [40]. The precise reason for residual activity is not 
clear since there are limited in vitro studies characterizing catalytic 
triad variants in SGNH hydrolase superfamily members. Howev- 
er, one possibility is the surrogate acetyl-donor tested does not 
optimally mimic the native acetyl donor of AlgJ. Residual 



hydrolysis of esterase substrates may also be attributed to surface 
amino acid residues that form microenvironments that promote 
spontaneous, non-enzymatic hydrolysis. For example, non-enzy- 
matic hydrolysis has been reported for the SGNH superfamily 
member glutathione-S-transferase [48] and in the non-enzymatic 
protein albumin [49,50]. The observation that the catalytic triad 
variants ablate O-acetylation in vivo further supports the notion 
that non-specific hydrolysis occurs in vitro. 

The electrostatic surfaces of P/»AlgJ 7 5_ 3 7 0 and the SGNH 
hydrolase-like domain of PaAlgX 2 7_474 are distinctly different with 
respect to the active site region. PflAlgX contains an electropos- 
itive groove that stretches from the active site to the C-terminal 
CBM [32]. We previously proposed that this highly conserved 
electropositive region would be compatible for binding the anionic 
alginate polymer. A direct ESI-MS binding assay confirmed that 
AlgX is able to bind mannuronic acid oligomers, with longer, 
more physiologically relevant polymers of mannuronic acid 
exhibiting higher affinity for the protein. The addition of a 
surrogate acetyl-donor in the presence of mannuronic acid 
oligomers led to the first demonstration in vitro that AlgX is an 
O-acetyltransferase capable of transferring acetyl groups to 
mannuronic acid oligomers. While ESI-MS cannot explicidy 
identify the location of carbohydrate-protein interaction, the 
ability for AlgX to O-acetylate mannuronic acid oligomers 
demonstrates that these oligosaccharides must, in part, bind to 
the active site. In comparison, P^AlgJ demonstrated very weak or 
no affinity toward mannuronic acid oligosaccharides under the 
conditions tested. Taken together, these data suggest that AlgX 
may be the only enzyme that O-acetylates alginate and that it 
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functions in a non-redundant, successive mechanism with the 
other proteins in the acetylation complex machinery. Our data 
allow us to propose an updated model for alginate O-acetylation 
(Figure 8). Briefly, Algl interacts with an unknown acetyl donor 
molecule in the cytoplasm and transfers acetate or acetyl donor 
across the inner membrane to either AlgJ and/or AlgF in the 
periplasm. The transmembrane domain of AlgJ (residues 12-29) 
tethers the enzyme to the membrane and potentially closer to Algl 
than either AlgF or AlgX which lack this spatial constraint. Given 
the current state of knowledge, the function and mechanism of the 
intermediate step(s) involving AlgJ or AlgF can only be speculated. 
The order of transfer of the acetyl donor is uncertain, but the fact 
that AlgJ can perform the initial esterase stage of transferase 
reaction suggests that, given the right donor, recipient, and 
environment, this enzyme is directly involved in the O-acetylation 
process. This possibility cannot be ruled out for AlgF either, but 
the lack of identifiable catalytic residues in this protein suggests 
that it may serve a more accessory role. However, it is clear that 
AlgX can catalyze the direct O-acetylation of alginate. Thus, it is 
conceivable that AlgX receives the acetate or acetyl donor 
molecule from either AlgJ or AlgF, and then subsequendy 
catalyzes the transfer of this substrate to O-acetylate alginate. 
This model is supported by the observation that active site variants 
of either AlgJ or AlgX are sufficient to abolish acetylation in vivo 
[32,33]. In addition, the data presented herein exclude the 
possibility that both enzymes can O-acetylate alginate. 

Although the O-acetylation of polysaccharides requires a 
minimum of two functionalities, O-acetylation of alginate and 
cellulose requires four distinct proteins including a membrane 
associated and non-membrane associated periplasmic O-acetyl- 
transferase. In contrast, the O-acetylation of peptidoglycan utilizes 
a single O-acetyltransferase that may not be constrained to the 
inner membrane depending on the presence of an N-terminal 
transmembrane domain. Regardless of the number of proteins 
involved in the system, the catalytic mechanism of acetate 
transfers across the inner membrane and the acetylation of the 
polymer is undoubtedly highly conserved. Additionally, the role 
of O-acetylation of these polysaccharides in both Gram-negative 
and Gram-positive bacteria is functionally similar, allowing for 
protection against external agents [16-21,26,27]. Intuitively, the 
requirement of four proteins involved in relaying acetate from 
the cytosol to the polysaccharide appears inefficient when 
compared to the O-acetylation of peptidoglycan. While addi- 
tional proteins could facilitate increased regulation, the extent of 
O-acetylation of peptidoglycan and alginate between species, 
strain, and culture conditions, varies between 20—70% (relative 
to muramic acid content) and 4-57%, respectively [51,52]. This 
indicates that the amount of O-acetylation cannot be simply 
categorized based on the number of proteins involved or their 
localization. 

A defining characteristic of cellulose and alginate biosynthesis 
compared to peptidoglycan is that these polysaccharides must 
traverse two membranes for export before reaching their final 
location, which in turn requires the involvement of several 
additional proteins in polymer modification and export. There- 
fore, four O-acetylation proteins may be an inherent requirement 
to adapt the O-acetylation machinery to the biosynthetic export 
machinery. In support of this, AlgX has been demonstrated to 
interact with both outer membrane alginate export proteins and 
periplasmic proteins required in alginate modification [53-57]. 
Such interactions have not been observed in the O-acetylation of 
peptidoglycan. The proteins WssFGHI are compulsory for 
cellulose acetylation [58] and WssGHI are homologous to AlgFIJ, 
with amino acid sequence identities of 24, 46, and 33%, 



respectively. We have previously suggested that WssF, which is 
predicted to belong to the SGNH hydrolase superfamily may be 
analogous to AlgX although it lacks the CBM present in AlgX 
[32]. Since proteins involved in cellulose acetylation have not been 
studied at either the structural or functional level, our present 
studies on AlgJ and AlgX are significant as they provide further 
data to support the roles of these proteins beyond sequence 
homology and phenotypic analysis. 

We have successfully determined the structure of AlgJ, a protein 
involved in the alginate O-acetylation pathway. AlgJ exhibits 
acetylesterase activity that is mediated through the Ser-His-Asp 
catalytic triad similar to that of the SGNH hydrolase-like enzyme 
AlgX. ESI-MS confirmed that AlgX but not AlgJ binds 
polymannuronic acid and we have conclusively demonstrated 
that AlgX is an O-acetyltransferase, and that it is the only 
annotated O-acetyltransferase in the pathway that can both 
interact with, and O-acetylate alginate. Refining the model for 
alginate O-acetylation provides new avenues for further studies 
into the mechanism of O-acetylation of polysaccharides in the 
microbial kingdom. 

Materials and Methods 

Chemicals, bacterial strains, plasmids, and growth media 

Superflow Ni 2+ NTA -agarose resin was obtained from Qiagen 
(Mississauga, ON). Graphitized carbon solid phase extraction 
columns (Carbograph SPE) are products of Grace Canada, Inc 
(Ajax, ON). All other chemicals and reagents, unless otherwise 
stated, were supplied by Sigma-Aldrich Canada Ltd. (Oakville, 
ON). All growth media was obtained from Bio Basic (Markham, 
ON). DNA manipulations were performed in E. coli DH5ot and 
protein expression of the SeMet protein was carried out using E. 
coli B834 (DE3) Met-auxotroph cells and grown in media 
supplemented with kanamycin at 50 |J,g mL -1 . Protein expression 
for binding and enzyme assays was carried out in E. coli BL21- 
CodonPlus cells. 

DNA manipulations 

The nucleotide sequences ofalgj from P. putida KT2440 and 
P. aeruginosa PAOl were acquired from the Pseudomonas 
Genome Database [59]. The boundaries of the putative 0- 
acetyltransferase domain of P. aeruginosa AlgJ were predicted to 
range from amino acids 79 to 379 (PaAlgJ 79 _ :i79 ) based on Phyre 2 
[60] structural alignments using the N-terminal domain of P. 
aeruginosa AlgX (PfflAlgX 44 _ 344 ) [32]. Sequence alignment 
revealed that these boundaries correspond to amino acids 75 to 
370 in the P. pulida AlgJ homolog (P^AlgJ 75 _ 370 ). PCR 
amplification was carried out using the high fidelity DNA 
polymerase, PfuTurbo (Stratagene). FastDigest restriction enzymes 
were obtained from Fermentas. Plasmid DNA was extracted from 
E. coli using the PureLink Quick plasmid miniprep kit from 
Invitrogen (Burlington, ON). Primer sequences used in the 
generation of wild type (WT) PaAlgJ 79 _ 379 and PfflAlgX 44 _ 344 , as 
well as proposed active site mutants are summarized in 
Supplementary Table SI. The PCR reaction conditions were as 
follows: Pfu buffer with 2 mM MgS0 4 (Thermo Scientific), 10 ng 
template DNA, 10 ng forward and reverse primer each, 25 u,M 
dNTPs, 2 mM MgS0 4 , 2.5 U Pfu polymerase (Thermo Scientific) 
in a total reaction volume of 25 uE. The PCR product was 
digested with Ndel and Xhol and ligated into a pET28a vector 
backbone to generate (i) a thrombin cleavable N-terminal hexahis- 
tidine tag (His 6 ) construct; and (ii) a second construct used for 
crystallography that contained both an N and C-terminal His 6 -tag. 
Site directed mutagenesis was performed using the QuikChange 
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Figure 8. Proposed model for the O-acetylation of alginate. Algl receives the currently unidentified acetyl donor molecule on its cytoplasmic 
face and shuttles the acetate or acetyl donor molecule through the inner membrane, where it is transferred to either AlgJ and/or AlgF. The acetyl 
donor may or may not be transferred between AlgJ and AlgF. From its intermediate location, the acetyl donor molecule is transferred to AlgX, which 
subsequently O-acetylates the alginate polymer. Although absolutely required for alginate O-acetylation the function of AlgF is currently unknown. 
We have demonstrated that AlgJ has activity on acetylated substrates thereby suggesting it fulfills a more direct role in the O-acetylation process. 
doi:1 0.1 371 /journal.ppat.1 004334.g008 



Lightning kit according to the prescribed protocol (Agilent Technol- 
ogies). Constructs generated were verified by sequencing performed 
by ACGT DNA Technologies Corporation (Toronto, ON). 

Expression and purification of AlgJ 

The expression and purification of native wild type and mutant 
PflAlgJ 7 9_ 3 79 and P/AlgJ 7 5_ 370 constructs were identical. SeMet 
P/>AlgJ 75 _ 370 was used for structure determination and was 
expressed using the protocol described by Lee et al [61]. The 
expression and purification protocol was as follows: Starter 
cultures were grown overnight in 50 mL Luria-Bertani (LB) broth 
containing 50 (Jig mL 1 kanamycin at 310 K in a shaking 
incubator, with E. coli BL21-CodonPlus cells transformed with 



the appropriate plasmid. The cells were subsequendy inoculated 
into 1 L LB broth containing 50 ug mL -1 kanamycin at 310 K in 
a shaking incubator. Upon reaching an OD 600 of 0.7, the cells 
were induced with isopropyl [S-D- 1 -thiogalactopyranoside (IPTG) 
to a final concentration of 1 mM. The induced cells were allowed 
to grow for an additional 1 8 h at 29 1 K. The cells were harvested 
via centrifugation at 5000 x g for 20 min at 277 K. The cell pellet 
was stored at 253 K until needed. Frozen cell pellet was thawed 
over ice and re-suspended in 50 mL cold lysis buffer (500 mM 
NaCl, 20 mM Tris-HCl pH 8.0) containing one SIGMAFAST 
EDTA-free protease-inhibitor cocktail tablet (Sigma). The cells 
were homogenized at 10,000 psi through an Emulsiflex C3 
(Avestin Inc.) with at least 3 passes until uniform in consistency. 
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The resultant cell lysate was centrifuged at 25000 x g for 
25 minutes at 278 K to pellet cell debris and insoluble material. 
The soluble cell lysate was loaded onto a 5 mL Ni 2+ -NTA gravity 
column equilibrated with Ni-NTA buffer (500 mM NaCl, 20 mM 
Tris-HCl pH 8.0, 5 mM imidazole). The column was washed with 
10 column volumes of Ni-NTA buffer containing 30 mM 
imidazole. Protein bound to the column was eluted with 4 column 
volumes of Ni-NTA buffer containing 150 mM imidazole. The 
eluent was dialyzed against 4 L of S200 buffer (150 mM NaCl, 
20 mM Tris-HCl pH 8.0). Protein concentration was measured 
using the Pierce BCA Protein Assay Kit from Thermo Scientific 
(Rockford, IL). The Hise-tag was cleaved from the protein via 
incubation with thrombin (Novagen) at 0.5 U per mg of protein at 
298 K for at least 2 h. The thrombin treated protein was loaded 
onto a 1 mL Ni 2+ -NTA column equilibrated with S200 buffer 
containing 5 mM imidazole. The column was washed with 10 
column volumes of S200 buffer containing 30 mM imidazole. The 
initial flow through and wash were pooled together and contained 
the un-tagged protein. The untagged protein was concentrated to 
a 1-2 mL volume using an Amicon Ultra centrifugation filter 
device (Milipore) with a 30 kDa cutoff. Approximately 20 mg of 
purified protein could be obtained per litre of bacterial culture. 
The concentrated protein was further purified via size exclusion 
chromatography on a HiLoad 16/60 Superdex 200 gel filtration 
column (GE Healthcare). Fractions containing protein were 
pooled and analyzed via SDS-PAGE to be >95% pure. 
PaAlgJ 7 9_ 3 79 and P^AlgJ 7 5_. i70 protein could be stored at 277 K 
for up to 2 or 4 weeks, respectively, before significant degradation 
was observed by SDS-PAGE. 

Crystallization and structure determination of PpAlgJ 75 _37 0 

SeMet PpAlgJ 75 _ 370 was concentrated to ~6-8 mg mL~ 1 by an 
Amicon Ultrafiltration device (30 kDa MWCO, Milipore) for 
crystallization trials. Sparse-matrix screens were setup by hand 
using MCSG suites 1-4 (Microlytic) in 48-well VDX plates 
(Hampton Research). The drops consisted of a 1:1 ratio of protein 
to well solution at a final volume of 4 uL equilibrated over 250 uL 
of well solution, and stored at 293 K. Numerous hits were 
obtained after 1 week and were primarily found in conditions 
containing a divalent cation (Mg + or Ca 2+ ) and polyethylene 
glycol (PEG) solutions between 3350-6000 Da. In most cases the 
crystals nucleated from a single point and radiated outwards 
forming a cluster with individual crystals estimated to be a 
maximum of 500 urn, and were of diffraction quality direcdy from 
the sparse matrix screens. 

Crystals used for data collection were found in MCSG-1, 
condition 7 (0.2 M MgCl 2 , 0.1 M Bis-Tris:HCl pH 5.5, 25% (w/ 
v) PEG3350). Prior to data collection, crystals were cryo-protected 
by exchanging the drop solution with cryo-protectant solution 
(0.2 M MgCl 2 , 0.1 M Bis-Tris:HCl pH 5.5, 25% (w/v) PEG3350, 
20% (v/v) ethylene glycol). The exchange was performed through 
the addition of cryo-protectant solution directly to the drop and 
the removal of the added volume until complete exchange had 
occurred. The crystal clusters were disrupted via physical contact 
with a fine needle until an isolated single crystal could be looped. 
Crystals were vitrified in liquid nitrogen and stored. Selenium 
single-wavelength anomalous dispersion (Se-SAD) X-ray diffrac- 
tion data were collected on beamline X29A at the National 
Synchrotron Light Source (NSLS) at Brookhaven National 
Laboratory. 90 images at 94% beam attenuation with 2° A(p 
oscillation and 360 images without beam attenuation with 1° A(p 
oscillation were collected on an ADSC Q315 CCD detector at a 
260 mm crystal-to-detector distance and 0.4 s exposure time per 
image. The Se-SAD data was indexed, integrated, scaled and 



merged using HKL-2000 (Table 1) [62], and used in conjunction 
with HKL2MAP to locate 12 (of 14) selenium sites, with density 
modified phases calculated using SOLVE/RESOLVE [63]. The 
electron density maps were of sufficient quality for automatic 
model building using PHENIX AutoBuild [64] and subsequent 
manual model building using COOT [65,66]. Model refinement 
was performed using PHENIX.REFINE [67] and progress was 
monitored as a function of the reduction and convergence of P wor k 
and -Rf ree (Table 1) [66] . TLS groups were added to the refinement 
in PHENIX through the use of the TLSMD server [68]. 

Structural and bioinformatics analysis of AlgJ 

All figures that display the structure of P/AlgT 7 r,_ :i70 and/ or 
PaAlgX 27 ^ 74 were generated using PyMol (The PyMol Molecular 
Graphics System, version 1.6.0.0, Schrodinger, LLC). The sec- 
ondary structure was determined using the STRIDE web server [69] . 
Surface representations demonstrating either electrostatics or surface 
residue conservation were depicted with all side chains present even if 
they could not be accurately modeled in the struc- 
ture. Electrostatic surfaces were generated using the ABPS Tools 
2.1 plugin that is integrated into PyMol [70]. Surface residue 
conservation was determined using the ConSurf web server [71] and 
visualized using the provided color scheme. Solvent accessible surface 
area was calculated using the PDBePISA server [72] . Surface residue 
conservation was determined using the ConSurf web server [71] and 
visualized using the provided color scheme. Solvent accessible surface 
area was calculated using the PDBePISA server [72]. 

Sequence alignments to determine AlgJ homologs were 
determined using BLAST [73]. Multiple sequence alignments 
were generated using Multiple Sequence Comparison by Log- 
Expectation (MUSCLE) [74,75]. Prediction of potential 
protein-protein interaction surfaces was performed using cons- 
PPISP [46,47]. 

Enzyme assay 

All enzyme assays were performed at least in triplicate, in a 96- 
well microtiter plate, using a SpectraMax M2 from Molecular 
Devices (Sunnyvale, CA). Standard reactions contained 3.0 mM 3- 
carboxyumbelliferyl acetate (ACC), dissolved in DMSO for specific 
activity assays and variable concentrations ranging between 0. 1 K m 
and &2 K m and 30 ug each wild-type protein and 100 ug for each 
variant in a total volume of 100 uL in 50 mM sodium HEPES 
buffer (pH 7.0 for AlgJ, pH 8.0 for AlgX) at 298 K. The final 
DMSO concentration did not exceed 10% (v/v). Due to low 
substrate solubility, reactions with higher substrate concentrations 
could not be obtained. Reactions were initiated by the addition of 
substrate and reactions were monitored in real time for a duration of 
10 min using an excitation of 386 nm and an emission of 447 nm as 
previously described [32]. The hydrolysis and release of acetate 
results in an increase in the fluorescence signal. Background 
hydrolysis rates, in the absence of enzyme, were monitored and 
subtracted from enzyme-catalyzed reactions. A calibration curve for 
7-hydroxycourmarin-3-carboxylic acid, the fluorescent hydrolysis 
product of 3-carboxyumbelliferyl acetate, was obtained under the 
reaction conditions and used to calculate reaction rate. The protein 
concentration of each enzyme variant was determined using the 
Pierce BCA Protein Assay Kit from Thermo Scientific (Rockford, 
IL). Data were fit by nonlinear regression to the Michealis-Menten 
equation using GraphPad Prism 6.0c for Mac, (GraphPad Software, 
La Jolla California USA, www.graphpad.com). 

Alginate binding assay 

Immediately prior to ESI-MS analysis, PfflAlgX 27 ^j. 74 and 
P«AlgJ 7 9_ 379 were each dialyzed against aqueous 100 mM ammo- 
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nium acetate (pH 7.0) using microconcentrators (MiJlipore Corp., 
Bedford, MA) with a MW cut-off of 30 kDa (for AlgX) and 10 kDa 
(for AlgJ). Two different reference proteins (P re f) were used to correct 
ESI mass spectra for the occurrence of nonspecific carbohydrate- 
protein binding (during the ESI process): a single chain fragment 
(scFv, MW 26 539 Da) of the monoclonal antibody (mAb) Se 155-4, 
which was produced and purified as described previously [76], and 
lysozyme (Lyz, MW 14 300 Da), which was obtained from Sigma- 
Aldrich Canada (Oakville, Canada) and used without further 
purification. Each protein was concentrated and dialyzed against 
50 niM ammonium acetate using microconcentrators (Millipore 
Corp., Bedford, MA) with a MW cut-off of 10 kDa and stored at — 
20°C until needed. Stock solutions of each of the polymannurnic acid 
oligomers, tetramer through dodecamer (ManAj - ManA 12 ), and 
undeca- and pentadeca-hyaluronic acid saccharides (HA n and 
HA 15 ), synthesized as previously described [77,78] were prepared by 
dissolving known amounts of solid compound in ultrafiltered water 
(Milli-Qj Millipore, Bedford, MA) to give a final concentration of 
~1 mM. The ligand solutions were stored at 253 K until needed. 

Affinity measurements were carried out on a Synapt G2 
quadrupole-ion mobility separation-time of flight (QJMS-TOF) 
mass spectrometer (Waters, UK) equipped with a modified 
nanoflow ESI (nanoESI) source. Complete details of the instru- 
mental and experimental conditions used for the ESI-MS binding 
measurements along with descriptions of how the data were 
analyzed to establish association constants (K a ) for protein-ligand 
interactions have been described previously [79-81]. 

O-Acetyltransferase assay 

O-Acetyltransferase assays were completed in a similar fashion 
as those previously described for the O-acetylation of peptidogly- 
can with modifications [51]. Briefly, 3 mM 4-nitrophenyl acetate 
dissolved in ethanol (3% final concentration), and 1 mM of 
ManA 10 [78], was incubated with 60 ug of AlgX in 50 mM 
sodium phosphate buffer, pH 7.0 for a duration of 1 h at 298 K. 
Control reactions containing 1 mM of cellohexose, xylohexose 
and maltotriose in place of alginate were also completed. All 
reactions were repeated in the absence of AlgX as a negative 
control. The total reaction volume was 150 |J,L and the reaction 
was initiated by the addition of protein. The reactions were 
quenched by applying the entire reaction to a 4 mL graphitized 
carbon solid phase extraction column previously washed with 
three column volumes of 100% acetonitrile containing 0.1% (v/v) 
trifluoroacetic acid (TFA) and then equilibrated with three column 
volumes of water. Following application of the reaction, the 
column was washed successively with three column volumes of 
100% acetonitrile, 100% acetonitrile containing 0.1% (v/v) TFA, 
3:1 isopropyl alcohol: acetonitrile (v/v). Polymannuronic acids 
were eluted with 6 mL of 50% tetrohydrofuran containing 0.1% 
TFA (v/v) and dried under vacuum in a Speed Vac at 298 K. The 
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dried samples were resuspended in 100 uL of water and stored at 
253 K prior to analysis. 

Liquid chromatography-mass spectrometry analyses were 
performed on an Agilent 1200 HPLC liquid chromatograph 
interfaced with an Agilent UHD 6530 Q;Tof mass spectrometer at 
the Mass Spectrometry Facility of the Advanced Analysis Centre, 
University of Guelph. A C18 column (Agilent Poroshell 120, 
150 mmx4.6 mm 2.7 urn) was used for chromatographic separa- 
tion with solution A (0.1% formic acid) and solution B (100% 
acetonitrile with 0. 1 % formic acid). The mobile phase gradient 
was as follows: initial conditions, 2% solution B increasing to 98% 
solution B in 30 min followed by a column wash at 98% solution B 
and 10 minute re-equilibration. The first 2 and last 5 minutes of 
gradient were sent to waste. The flow rate was maintained at 
0.1 mL/min. The mass spectrometer electrospray capillary 
voltage was maintained at 4.0 kV and the drying gas temperature 
at 523 K with a flow rate of 8 L/min. Nebulizer pressure was 
30 psi and the fragmentor was set to 160. Nitrogen was used as 
both the nebulizing and drying gas, and the collision-induced gas. 
The mass-to-charge ratio was scanned across the m/z range of 
100-3000 m/z in 4 GHz (extended dynamic range positive-ion 
auto MS/MS mode). Three precursor ions per cycle were selected 
for fragmentation. The instrument was externally calibrated with 
the ESI TuneMix (Agilent). The sample injection volume was 
10 JU.1. The resultant spectra were analyzed using mMass software 
(http : / / www. mmass . org/) . 

Accession numbers 

The coordinates and structure factors for -P^AlgJy-^yr, have 
been deposited in the PDB, ID code 408V. 
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